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-9/936033 

IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 
Applicant : Philippe Bordes and Philippe Guillotel 

Filed : February 29, 2000 - PCT National Phase of PCT/EPOO/01 688 

For : PROCESS FOR EVALUATING CODED IMAGES, DEVICE 

IMPLEMENTING AND PROCESS AND USE OF THE DEVICE AND PROCESS 

PRELIMINARY AMENDMENT 

Hon. Commissioner of Patents and Trademarks 
Box PCT 

Washington, D.C. 20231 
Sir: 

In the US national phase application of PCT/EPOO/01 688 
please enter the following amendments. 

IN THE TITLE: 

Please delete the title and insert the new title as published in the 
PCT international Application - PROCESS, DEVICE AND USE FOR 
EVALUATING CODED IMAGES - 

IN THE SPECIFICATION : 

Please amend the specification as follows: 

Page 1 , line 4 after the title, insert the following: 

"This application claims the benefit under 35 U.S.C. § 365 of 
International Application PCT/EPOO/01 688, filed February 29, 2000, which 
claims the benefit of French Patent Application No. 9902827, filed March 8, 
1999.-- 

IN THE CLAIMS : 

Please amend the claims as follows. This is the clean version. 
Attached is the marked up version of these claims. 

1 . Process for evaluating the quality of coded images, wherein it 
comprises: 
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a) a step of processing the signal representative of the image so 
as to obtain a processed signal, 

b) a step of constructing on the basis of the signal 
representative of the coded image, a signal representative of the field of 
motion image on the basis of the source sequence, 

c) a step of building a signal representative of the segmenting of 
the field of motion and of storing the image pixels representative of each 
region having a different field of motion at an address defined with respect to 
the velocity vectors estimated in the step of constructing the field of motion 
making it possible to determine the pixels having different velocity vectors, 

d) a step of determining or of calculating a psychovisual human 
filter to be applied as a function of the estimated velocity of the region, 

e) a step of filtering the processed signal, and 

f) a step of constructing the map of disparities between the 
signals representative of the image which are obtained after the filtering step 
and the signals representative of the decoded image which are obtained after 
the filtering step. 

2. Process for evaluating the quality of coded images according to 
Claim 1 , wherein it comprises a step consisting in applying each of the 
preceding steps to the source image and to the decoded image. 

3. Process for evaluating the quality of coded images according to 
Claim 1, wherein it comprises a step of frequency decomposition of the 
images (FFT, subband, etc.) which precedes the filtering step and consists of 
a weighting by a coefficient deduced from curves taking into account the 
estimated velocity and the frequency band considered, so as to take account 
of the relative influence of the velocity and of the spatial frequency on the 
perception of the moving images. 

4. Process according to Claim 1 , wherein the psychovisual filtering step is 
applied to matrices representative of the inter-pyramid differences between 
the Laplace pyramids of the processed source images and those of the 
processed decoded images after weighting by, on the one hand, the local 
influence representative of the frequency of the pixel concerned and, on the 
other hand, a filtering coefficient deduced from filtering curves taking Into 
account the estimated velocity and the frequency band corresponding to the 
level of the Laplace pyramid to which the pixel belongs in a multiresolution 
pyramid obtained by constructing a pyramid on the basis of the image of each 
region of different velocity. 

5. Process according to Claim 1 , wherein the psychovisual filtering curves 
are either built from a succession of curves arranged in the form of a 
database and stored in the system, and possibly interpolation on the basis of 
these curves, or obtained by analytical representation implemented by 
calculation means making it possible to calculate each curve. 

6. Process according to Claim 4, wherein the step of constructing the 
map of disparities is performed by recomposing the filtered multiresolution 
pyramids obtained in the preceding step. 
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7. Process according to Claim 4, wherein the step of processing the 
image comprises a step of decomposing the source and decoded images into 
a Laplace pyramid of n levels and a step of constructing the inter-pyramid 
difference. 

8. Process according to Claim 1 , wherein the velocity or local value of the 
motion is obtained by possible construction of filters followed by application of 
the filter constructed or by application of a median filter. 

9. Process according to Claim 1 , wherein it comprises a step of 
precorrecting the images by performing a Gamma correction and a correction 
by Weber's law. 

10. Process according to Claim 7, wherein the Gamma correction y is as 
follows: 

y = Ks V^' with V = kaE^^ 

in which y is the luminance, V the luminance voltage, E the illumination of the 
illumination analysed image, ys is an exponent of around 2.2 for black and 
white picture tubes and ya has a value of 0.45 commonly agreed for colour 
television. 

1 1 . Process according to Claim 1 , wherein the filtering is obtained by 
constructing the psychovisual filter corresponding to the velocity estimated on 
the basis of a database of filters and interpolation between the two filters 
corresponding to the regions closest to the region whose velocity has been 
estimated. 

12. Process according to Claim 4, wherein the relative local influence (In) 
of the pixel pi concerned is obtained by calculating a value En representing 
the q**^ power of the inter-pyramid level-to-level difference between the source 
pyramids and decoded pyramids of like level of the pixel concerned. 

13. Process according to Claim 12, wherein the calculation of In is 
performed by using the following formula: 

/ 

k{n 

with En = (Diffn(Pij))'', 

m(Ek) = Ek if Ek > S 
and m(Ek) = S if Ek < S 

with for example S = 0.5% (maximum possible value of Ek). 

14. Process according to Claim 4, wherein the filtering comprises a 
directional filtering of the images in a determined direction rather than in 
another. 

15. Process according to Claim 9, wherein the Gamma correction is 
performed by a calculation device implementing the following equation: 
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^display - -^max 

e being the grid level value of the pixel, emax being the maximum value 
example 256 If the coding is performed on 8 bits, L^ax being the intensity 
corresponding to emax in cd/m^. 

16. Process according to Claim 9, wherein Weber's law Is Implemented by 
a calculation device which carries out the following function: 



1 7. Process according to Claim 1 , wherein the calculation of the filter is 
obtained through the following formula: 

G(a,v) = [6.1 + 7.3 1 log{v/3) | ^ x va2exp[-2oc(v + 2y45.9] 

with a = 27Tf, f = spatial frequency, v = velocity. 

1 8. Use of the process according to Claim 1 , In a coding device, 
characterized by a dynamic retroaction as a function of the psychovisual 
disparities calculated by the calculation device implementing the process on 
one of the parameters used by the coding device In the course of the coding. 

19. Use of the process according to Claim 18, wherein the calculated 
disparities are compared with a threshold so as to modify the coding 
parameters of the coding apparatus until the desired threshold is 
overstepped. 

20. Use of the process according to Claim 1 9, wherein one of the 
parameters is either the quantization interval, or the size of the images, or the 
form of the group of pictures GOP. 

21 . Use of the process according to Claim 18, wherein the homogeneity of 
the calculated disparities is analysed by the calculation device so as to act on 
the coding parameters of the coding apparatus. 

22. Use of the process according to Claim 1 8, wherein the coding 
parameters of the different objects of an image whose coding is object 
oriented are modified as a function of a constant desired disparity. 

23. Use of the process according to Claim 18, wherein It consists in 
performing a dynamic reallocation of the bit rates allocated to a coding 
apparatus with multiplexing. 

24. Device for evaluating the quality of coded images, wherein it 
comprises: 

- a means of processing the signal representative of the source 
image (10a) and of the decoded image [(10b)] so as to obtain a processed 
source image signal and a processed decoded Image signal. 
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- means of constructing on the basis of the signal 
representative of each of the images, a signal representative of the estimating 
of the field of motion on the basis of each of the images of the source and 
decoded sequences, 

- means of building a signal representative of the segmenting of 
the field of motion and of storing the image pixels representative of each 
region R\ having a different field of motion at an address defined with respect 
to the velocity vectors estimated in the step of constructing the field of motion 
making it possible to determine for each of the source and decoded images 
those having different velocity vectors, 

- a means of determining or of calculating a psychovisual 
human filter to be applied as a function of the estimated velocity of the region, 

- means of filtering applied to each of the processed source 
images and processed decoded images and 

- a means of constructing the map of disparities between the 
signals representative of the processed source image which are obtained 
after the filtering step and the signals representative of the processed 
decoded image which are obtained after the filtering step. 

25. Device according to Claim 24, wherein the psychovisual filtering means 
are applied to matrices representative of the inter-pyramid differences 
calculated by calculation means between the Laplace pyramids of the 
processed source images and those of the processed decoded images after 
weighting by, on the one hand, the local influence representative of the 
frequency of the pixel concerned and, on the other hand, a filtering coefficient 
deduced from stored or calculated filtering curves and taking into account the 
estimated velocity and the frequency band corresponding to the level of the 
Laplace pyramid to which the pixel belongs in a multiresolution pyramid 
obtained by means of constructing this multiresolution pyramid on the basis of 
the image of each region of different velocity. 

26. Device according to Claim 24, wherein the means of constructing the 
map of disparities perform a recomposition of the filtered multiresolution 
pyramids. 

27. Device according to one of Claim 24, wherein the means of 
processing, the means of building, the means of determining, the means of 
constructing, the means of filtering consist of at least one microprocessor 
associated with memories sufficient to contain the programs making it 
possible to embody the various means and to contain the databases and the 
intermediate information necessary for the calculation and for obtaining the 
map of disparities. 

28. Process according to Claim 1 , the images being coded according to 
the MPEG standard, wherein the step of constructing a signal representative 
of the field of motion image exploits the per-macro block motion vectors 
calculated during the coding of the images according to the MPEG standard. 

29. Process according to Claim 1 , wherein the decoded image is a noisy 
source image constructed on the basis of the source image to which white 
noise is added. 
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30. Use of the process according to Claim 29 for predicting, on the basis of 
the map of disparities, the regions most sensitive "a priori" to the coding 
errors and for coding the regions as a function of this prediction. 

31 . Use of the process according to Claim 29 to perform a prefiltering of 
the source images as a function of the map of disparities. 

32. Use of the process according to Claim 29 for determining locally the 
amount of information which can be Inserted into the images (Watermarking) 
without this addition being perceptible. 



IN THE ABSTRACT: 

Please add the attached Abstract. 



The specification has been amended to include a reference to 
the priority applications. 

The above amendments to the claims have been made to 
eliminate reference indicia and to meet the requirements of the USPTO. 

To meet the requirements of the United States, the Abstract, as 
filed, has been amended. 

No fee is believed to have been incurred by virtue of this 
amendment. However, if a fee is incurred on the basis of this 
amendment, please charge such fee against deposit account 07-0832. 



REMARKS 



Respectfully submitted, 
Philippe Bordes 
Philippe Guillotel 




Ronald^rf Kyf^yla, Attorney 
Registration No. 26,932 
609/734-9701 



THOMSON multimedia Licensing Inc. 
Patent Operation 

PO Box 5312, Princeton, NJ 08543-5312 



Date: y '\~2^D^ ) 
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MARKED UP CLAIMS 

1 . Process for evaluating the quality of coded images, [characterized in 
that] wherein it comprises: 

5 a) a step of processing the signal representative of the image 

so as to obtain a processed signal, 

b) a step of constructing on the basis of the signal 
representative of the coded image, a signal representative of the field of 
motion image on the basis of the source sequence, 

10 c) a step of building a signal representative of the segmenting 

of the field of motion and of storing the image pixels representative of each 
region having a different field of motion at an address defined with respect to 
the velocity vectors estimated in the step of constructing the field of motion 
making it possible to determine the pixels having different velocity vectors, 

15 d) a step of determining or of calculating a psychovisual human 

filter to be applied as a function of the estimated velocity of the region, 

e) a step of filtering the processed signal, and 

f) a step of constructing the map of disparities between the 
signals representative of the image which are obtained after the filtering step 

20 and the signals representative of the decoded image which are obtained after 
the filtering step. 

2. Process for evaluating the quality of coded images according to 
Claim 1 , [characterized in that] wherein it comprises a step consisting in 

25 applying each of the preceding steps to the source image and to the decoded 
image. 

3. Process for evaluating the quality of coded images according to 
Claim 1 , [characterized in that] wherein it comprises a step of frequency 

30 decomposition of the images (FFT, subband, etc.) which precedes the 
filtering step and consists of a weighting by a coefficient deduced from 
curves taking into account the estimated velocity and the frequency band 
considered, so as to take account of the relative influence of the velocity and 
of the spatial frequency on the perception of the moving images. 

35 

4. Process according to Claim 1 , [characterized in that] wherein the 
psychovisual filtering step is applied to matrices representative of the 
inter-pyramid differences between the Laplace pyramids of the processed 
source images and those of the processed decoded images after weighting 

40 by, on the one hand, the local influence representative of the frequency of 
the pixel concerned and, on the other hand, a filtering coefficient deduced 
from filtering curves taking into account the estimated velocity and the 
frequency band corresponding to the level of the Laplace pyramid to which 
the pixel belongs in a multiresolution pyramid obtained by constructing a 

45 pyramid on the basis of the image of each region of different velocity. 

5. Process according to Claim 1 , [characterized in that] wherein the 
psychovisual filtering curves are either built from a succession of curves 
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arranged in the form of a database and stored in the system, and possibly 
interpolation on the basis of these curves, or obtained by analytical 
representation implemented by calculation means making it possible to 
calculate each curve. 

5 

6. Process according to Claim 4, [characterized in that] wherein the step 
of constructing the map of disparities is performed by recomposing the 
filtered multiresolution pyramids obtained in the preceding step. 

1 0 7. Process according to Claim 4 [or 6], [characterized in that] wherein the 
step of processing the image comprises a step of decomposing the source 
and decoded images into a Laplace pyramid of n levels and a step of 
constructing the inter-pyramid difference. 

1 5 8. Process according to Claim 1 , [characterized in that] wherein the 
velocity or local value of the motion is obtained by possible construction of 
filters followed by application of the filter constructed or by application of a 
median filter. 

20 9. Process according to Claim 1 [or 4], [characterized in that] wherein it 
comprises a step of precorrecting the images by performing a Gamma 
correction and a correction by Weber's law. 

10. Process according to Claim 7, [characterized in that] wherein the 
25 Gamma correction y is as follows: 

y = Ks with V = kaE^^ 

in which y is the luminance, V the luminance voltage, E the illumination of the 
illumination analysed image, ys is an exponent of around 2.2 for black and 
30 white picture tubes and ya has a value of 0.45 commonly agreed for colour 
television. 

1 1 . Process according to Claim 1 , [characterized in that] wherein the 
filtering is obtained by constructing the psychovisual filter corresponding to 

35 the velocity estimated on the basis of a database of filters and interpolation 
between the two filters corresponding to the regions closest to the region 
whose velocity has been estimated. 

12. Process according to Claim 4, [characterized in that] wherein the 

40 relative local influence (L) of the pixel pi concerned is obtained by calculating 
a value En representing the q*^ power of the inter-pyramid level-to-level 
difference between the source pyramids and decoded pyramids of like level 
of the pixel concerned. 

45 13. Process according to Claim 12, [characterized in that] wherein the 
calculation of In is performed by using the following formula: 
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k(n 

with En = (Diffn(Pij))^ 

m(Ek) = Ek if Ek > S 
and m(Ek) = S if Er < S 

5 

witli for example S = 0.5% (maximum possible value of Ek). 

14. Process according to Claim 4, [characterized in that] wherein the 
filtering comprises a directional filtering of the images in a determined 

10 direction rather than in another. 

15. Process according to Claim 9, [characterized in that] wherein the 
Gamma correction is performed by a calculation device implementing the 
following equation: 

( e Y 

■> 5 ^display - ^max ^ 

vemax-^ 

e being the grid level value of the pixel, Cmax being the maximum value 
example 256 if the coding is performed on 8 bits, L^ax being the intensity 
corresponding to Omax in cd/m^. 

20 16. ProGess-acGordrng to Claims, [characterized in that! wherein Weber's 

law is implemented by a calculation device which carries out the following 
function: 

Vout =—^LogiQ 1 + 100— 



17. Process according to Claim 1 [or 4], [characterized in that] wherein the 
calculation of the filter is obtained through the following formula: 

G(a,v) = [6.1 + 7.3 1 log(v/3) | ^ x va2exp[-2a(v + 2y45.9] 

with a = 27if, f = spatial frequency, v = velocity. 

18. Use of the process according to [one of the preceding claims] Claim 1 . 
in a coding device, characterized by a dynamic retroaction as a function of 
the psychovisual disparities calculated by the calculation device 
implementing the process on one of the parameters used by the coding 
device in the course of the coding. 

19. Use of the process according to Claim 1 8, [characterized in that] 
wherein the calculated disparities are compared with a threshold so as to 
modify the coding parameters of the coding apparatus until the desired 
threshold is overstepped. 
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20. Use of the process according to Claim 19, [characterized in that] 
wherein one of the parameters Is either the quantization interval, or the size 
of the images, or the form of the group of pictures GOP. 

5 21 . Use of the process according to Claim 18, [characterized in that] 
wherein the homogeneity of the calculated disparities is analysed by the 
calculation device so as to act on the coding parameters of the coding 
apparatus. 

1 0 22. Use of the process according to [the preceding claims] Claim 18 , 

[characterized in that] wherein the coding parameters of the different objects 
of an image whose coding is object oriented are modified as a function of a 
constant desired disparity. 

1 5 23. Use of the process according to [Claims 1 8 to 22] Claim 18. 
[characterized in that] wherein it consists in performing a dynamic 
reallocation of the bit rates allocated to a coding apparatus with multiplexing. 

24. Device for evaluating the quality of coded images, [characterized in 
20 that] wherein It comprises: 

- a means [(la, lb)] of processing the signal representative of 
the source image (10a) and of the decoded image [(10b)] so as to obtain a 
processed source image signal and a processed decoded image signal, 

- means [(2a, 2b)] of constructing on the basis of the signal 
25 representative of each of the images, a signal representative of the 

estimating of the field of motion on the basis of each of the Images of the 
source and decoded sequences, 

- means [(3a, 3b)] of building a signal representative of the 
segmenting of the field of motion and of storing the image pixels 

30 representative of each region Rj having a different field of motion at an 

address defined with respect to the velocity vectors estimated in the step of 
constructing the field of motion making it possible to determine for each of 
the source and decoded images those having different velocity vectors, 

- a means [(4, 5)] of determining or of calculating a 

35 psychovisual human filter to be applied as a function of the estimated velocity 
of the region, 

- means [(6a, 6b)] of filtering applied to each of the processed 
source images and processed decoded images and 

- a means [(7)] of constructing the map of disparities between 
40 the signals representative of the processed source image which are obtained 

after the filtering step and the signals representative of the processed 
decoded image which are obtained after the filtering step. 

25. Device according to Claim 24, [characterized in that] wherein the 
45 psychovisual filtering means are applied to matrices representative of the 

Inter-pyramid differences calculated by calculation means between the 
Laplace pyramids of the processed source images and those of the 
processed decoded images after weighting by, on the one hand, the local 
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influence representative of tine frequency of the pixel concerned and, on the 
other hand, a filtering coefficient deduced from stored or calculated filtering 
curves and taking into account the estimated velocity and the frequency band 
corresponding to the level of the Laplace pyramid to which the pixel belongs 
5 in a multiresolution pyramid obtained by means of constructing this 

muitiresolution pyramid on the basis of the Image of each region of different 
velocity. 

26. Device according to Claim 24, [characterized in that] wherein the 

1 0 means of constructing the map of disparities perform a recomposition of the 
filtered multiresolution pyramids. 

27. Device according to one of [Claims 24 to 26] Claim 24 . [characterized 
in that] wherein the means of processing, the means of building, the means 

1 5 of determining, the means of constructing, the means of filtering consist of at 
least one microprocessor associated with memories sufficient to contain the 
programs making it possible to embody the various means and to contain the 
databases and the intermediate information necessary for the calculation and 
for obtaining the map of disparities. 

20 

28. Process according to Claim 1 , the images being coded according to 
the MPEG standard, [characterized in that] wherein the step of constructing a 
signal representative of the field of motion image exploits the per-macroblock 
motion vectors calculated during the coding of the images according to the 

25 MPEG standard. 

29. Process according to Claim 1 , [characterized in that] wherein the 
decoded image is a noisy source image constructed on the basis of the 
source image to which white noise is added. 

30 

30. Use of the process according to Claim 29 for predicting, on the basis 
of the map of disparities, the regions most sensitive "a priori" to the coding 
errors and for coding the regions as a function of this prediction. 

35 31 . Use of the process according to Claim 29 to perform a prefiltering of 
the source images as a function of the map of disparities. 

32. Use of the process according to Claim 29 forndetermining locally the 
amount of information which can be inserted into the images (Watermarking) 
40 without this addition being perceptible. 



PF990005-FOFI 



FOR THE PURPOSES OF INFORMATION ONLY 



Codes used to identify States party to the PCX on the front pages of pamphlets publishing international applications under the PCX. 



AL 




ES 


Spain 


LS 


Lesotho 


SI 


Slovenia 


AM 


Armenia 


FI 


Finland 


LT 


Lithuania 


SK 


Slovakia 


AT 




FR 




LU 


Luxembourg 


SN 


Senegal 


AU 


Australia 


GA 




LV 




SZ 


Swaziland 


AZ 


Azerbaijai 


GB 


United Kingdoim 


MC 




TD 


Chad 


BA 


Bosnia and Herzegovina 


GE 




MD 


Republic of Moldova 


TG 


Togo 


BB 


Barbados 


GH 




MG 


Madagascar 


TJ 




BE 


Belgium 


GN 


Guinea 


MK 


The former Yugoslav 


TM 




BF 


Burkina Faso 


GR 






Republic of Macedonia 


TR 


Turkey 


BG 


Bulgaria 


HU 


Hungaiy 


ML 


Mali 


XT 


Trinidad and Tobago 


BJ 




IE 




MN 


Mongolia 


UA 


Ukraine 




Brazil 


IL 




MR 


Mauritania 


UG 


Uganda 


BY 




IS 


Iceland 


MW 


Malawi 


US 


United States of America 


CA 




IT 


Italy 


MX 


Mexico 


uz 


Uzbekistan 


CF 


Central African Republic 


JP 




NE 




VN 


Viet Nam 


CG 


Congo 


KE 




NL 


Netherlands 


YU 


Yugoslavia 


CH 


Switzerland 


KG 


Kyrgyzstan 


NO 


Norway 


zw 


Zimbabwe 


a 


Cate d'lvoire 


KP 


Democratic People's 


NZ 


New Zealand 






CM 


Cameroon 




Republic of Korea 


PL 


Poland 






CN 




KR 


Republic of Korea 


FT 


Portugal 






cu 




KZ 


Kazakstan 




Romania 






cz 


Czech Republic 


LC 


Saint Lucia 


RU 


Russian Federation 






DE 


Germany 


LI 


Liechtenstein 


SD 








DK 


Denmark 


LK 


Sri Lanka 


SE 








EE 




LR 




SG 


Singapore 







PTO^CTftec^d 07 SEP 2001 



wo 00/54220 PCT/EPOO/01688 
PROCESS, DEVICE AND USE FOR EVALUATING CODED IMAGES 



The present invention relates to a process and a device for 
5 evaluating the quality of coded images as well as to the use of such a 
process and a device. 

in systems for coding digital video sequences, it is known practice 
to estimate the quality of the images output by the procedure by comparison 
with the original image, using the signal-to-noise ratio. 
10 This ratio is generally termed the PSNR (Peak Signal to Noise 

Ratio) and obtained by summing the quadratic differences of the pixels of the 
final image and of the original image. 

However, this measure does not take into account the 
psychovisuai characteristics of human vision (HVS: Human Visual System). 
15 indeed, the human eye is more sensitive to certain spatial frequencies and 
Its perception of the details of objects is strongly linked with their relative 
motion and also with the phenomena of luminosity and contrast. 

Thus a sequence, which can, according to the traditional quality 
estimation procedure, appear as the outcome of good coding and be 
20 assumed to have a good image quality, will not be perceived in this manner 
by the observer, given his psychovisuai characteristics. 

The object of the invention is therefore to allow assessment of the 
quality of images which approximates as closely as possible to the 
perception which the observer will have thereof. 
25 The influence of the human factor (HVS) is partially taken into 

account in traditional coders of MPEG2 type In the perception of the spatial 
frequencies of the decoded images by using a weighting matrix on the high 
frequencies of the 8 by 8 image blocks, but absolutely no account is taken as 
regards the perception of the details of objects in motion. 
30 Most of the studies which measure the disparities between two 

images are essentially based on a static analysis of the defects taking no 
account of motion, or else on an analysis of the spatial frequencies. 

Additionally, the few prior studies which take the temporal aspect 
into consideration, but take no account of the motion proper, use the 
35 difference (distortion) between a current macroblock and the macroblock in 
the same position in the preceding image. 
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None of these studies considers the influence of human vision 
together with the problems of motion in the image. 

The puipose of the invention is to alleviate the drawbacks of the 

prior art. 

5 This purpose is achieved by the fact that the process for 

evaluating the quality of images comprises: 

a) a step of processing the signal representative of the image so 
as to obtain a processed signal, 

b) a step of constructing on the basis of the signal representative 
10 of the coded image, a signal representative of the field of motion image on 

the basis of the source sequence, 

c) a step of building a signal representative of the segmenting of 
the field of motion and of storing the image pixels representative of each 
region having a different field of motion at an address defined with respect to 

15 the velocity vectors estimated in the step of constructing the field of motion 
making it possible to determine the pixels having different velocity vectors, 

d) a step of determining or of calculating a psychovisual human 
filter to be applied as a function of the estimated velocity of the region, 

e) a step of filtering the processed signal, and 

20 f)a step of constructing the map of disparities between the 

signals representative of the image which are obtained after the filtering step 
and the signals representative of the decoded image which are obtained 
after the filtering step. 

According to another particular feature, the process for evaluating 

25 the quality of images comprises a step consisting in applying each of the 
steps to the source image and to the decoded image. 

According to another particular feature, the process for evaluating 
the quality of coded images comprises a step of frequency decomposition of 
the images (FFT, subband, etc.) which precedes the filtering step and 

30 consists of a weighting by a coefficient deduced from curves taking into 
account the estimated velocity and the frequency band considered, so as to 
take account of the relative influence of the velocity and of the spatial 
frequency on the perception of the moving images. 

According to another particular feature, the psychovisual filtering 

35 step is applied to matrices representative of the inter-pyramid differences 
between the Laplace pyramids of the processed source images and those of 
the processed decoded images after weighting by, on the one hand, the 
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local influence representative of the frequency of the pixel concerned and, 
on the other hand, a filtering coefficient deduced from filtering curves taking 
into account the estimated velocity and the frequency band corresponding to 
the level of the Laplace pyramid to which the pixel belongs in a 
5 multiresolution pyramid obtained by constructing a pyramid on the basis of 
the image of each region of different velocity. 

According to another particular feature, the psychovisual filtering 
curves are either built from a succession of curves arranged in the form of a 
database and stored in the system, and possibly inteipolation on the basis of 
10 these curves, or obtained by analytical representation implemented by 
calculation means making it possible to calculate each curve. 

According to another particular feature, the step of constructing 
the map of disparities is performed by recomposing the filtered 
multiresolution pyramids obtained in the preceding step. 
15 According to another particular feature, the step of processing the 

image comprises a step of decomposing the source and decoded images 
into a Laplace pyramid of n levels and a step of constructing the 
inter-pyramid difference. 

According to another particular feature, the velocity or local value 
20 of the motion is obtained by possible construction of filters followed by 
application of the filter constructed or by application of a median filter. 

According to another particular feature, the process comprises a 
step of precorrecting the images by performing a Gamma correction and a 
correction by Weber's law. 
25 According to another particular feature, the Gamma correction is 

performed by a device implementing the following formula: 
y = KSV^ 

with V = kaE^^ 

in which y is the luminance, V the luminance voltage, E the illumination of 
30 the analysed image y is an exponent of around 2.2 for black and white 
picture tubes and ya has a value of 0.45 commonly agreed for colour 
television. 

According to another particular feature, the filtering is obtained by 
constructing the psychovisual filter corresponding to the velocity estimated 
35 on the basis of a database of filters and interpolation between the two filters 
corresponding to the regions closest to the region whose velocity has been 
estimated. 
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According to another particular feature, the relative local influence 
(In) of the pixel pj concerned is obtained by calculating a value Ep 
representing the q*^ power of the inter-pyramid level-to-level difference 
between the source pyramids and decoded pyramids of like level of the pixel 
concerned. 

According to another particular feature, the calculation of In is 
performed by using the following formula: 

/ = ^" 
k{rt 

with E„ = (Diffn(pij))^ 
m(Ek) = Ek if Ek > S 
and m(Ek) = S if Er < S 

with for example S = 0.5% (maximum possible value of Ek). 

According to another particular feature, the filtering comprises a 
directional filtering of the images in a determined direction rather than in 
another. 

According to another particular feature, the Gamma correction is 
performed by a calculation device implementing the following equation: 

( e Y 
^display ~ Anax „ 

v^max^' 

e being the grid level value of the pixel, emax being the maximum value 
example 256 if the coding is performed on 8 bits, Lmax being the intensity 
corresponding to emax in cd/m^. 

According to another particular feature, Weber's law is 
implemented by a calculation device which carries out the following function: 



Vout =-^^ogio 1 + 100--^ , 

According to another particular feature, the calculation of the 
psychovisual filter is obtained by a device implementing the following 
formula: 

G(a,v) = [6.1 + 7.3(log(vy3)^ x va^exp{-2a(v + 2)/45.9] 
with a = 27if, f = spatial frequency, v = velocity. 

Another purpose of the invention is to propose a use of the 
process according to the invention. 
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This other purpose is achieved by the fact that the process of the 
invention is used in a coding device, by a dynamic retroaction as a function 
of the psychovisuai disparities calculated by the calculation device 
implementing the process of one of the parameters used by the coding 
5 device in the course of the coding. 

According to another particular feature, the calculated disparities 
are compared with a threshold so as to modify the coding parameters of the 
coding apparatus until the desired threshold is overstepped. 

According to another particular feature, one of the coding 
10 parameters is either the quantization interval, or the size of the images, or 
the form of the group of pictures GOP. 

According to another particular feature, the process of the 
invention is used in the analysis of the homogeneity of the calculated 
disparities so as to act on the coding parameters. 
15 According to another particular feature, the process of the 

invention is used to modify the coding parameters of the different objects of 
an image whose coding is object oriented as a function of a constant desired 
disparity. 

According to another particular feature, the process of the 
2 0 invention is used to perform a dynamic reallocation of the allocated bit rates. 

A last purpose of the invention Is to propose a device 
implementing the process. 

This purpose is achieved by the fact that the evaluating device 

comprises: 

25 - a means (1a, 1b) of processing the signal representative of the 

source image (10a) and of the decoded image (10b) so as to obtain a 
processed source image signal and a processed decoded image signal, 

-means (2a, 2b) of constructing on the basis of the signal 
representative of each of the images, a signal representative of the 

30 estimating of the field of motion on the basis of each of the images of the 
source and decoded sequences, 

-means (3a, 3b) of building a signal representative of the 
segmenting of the field of motion and of storing the image pixels 
representative of each region Rj having a different field of motion at an 

35 address defined with respect to the velocity vectors estimated in the step of 
constructing the field of motion making it possible to detemnine for each of 
the source and decoded images those having different velocity vectors, 
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- a means (4, 5) of determining or of calculating a psychovisual 
human filter to be applied as a function of the estimated velocity of the 
region, 

- means (6a, 6b} of filtering applied to each of the processed 
5 source images and processed decoded images and 

- a means (7) of constructing the map of disparities between the 
signals representative of the processed source image which are obtained 
after the filtering step and the signals representative of the processed 
decoded image which are obtained after the filtering step. 

10 According to another particular feature, the psychovisual filtering 

means are applied to matrices representative of the inter-pyramid 
differences calculated by calculation means between the Laplace pyramids 
of the processed source images and those of the processed decoded 
images after weighting by, on the one hand, the local influence 

15 representative of the frequency of the pixel concerned and, on the other 
hand, a filtering coefficient deduced from stored or calculated filtering curves 
and taking into account the estimated velocity and the frequency band 
corresponding to the level of the Laplace pyramid to which the pixel belongs 
in a multiresolution pyramid obtained by means of constructing this 

20 multiresolution pyramid on the basis of the image of each region of different 
velocity. 

According to another particular feature, the means of constructing 
the map of disparities perform a recomposition of the filtered multiresolution 
pyramids. 

25 According to another particular feature, the means of processing, 

the means of building, the means of determining, the means of constructing, 
the means of filtering consist of at least one microprocessor associated with 
memories sufficient to contain the programs making it possible to embody 
the various means and to contain the databases and the intemnediate 

30 information necessary for the calculation and for obtaining the map of 
disparities. 

Other particular features and advantages of the present invention 
will become more clearly apparent on reading the description given 
hereinbelow with reference to the appended figures in which: 
35 - Figure 1a represents a schematic view of the steps of a first 

variant of the process, 
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- Figure 1b represents a graphic representation of the result of the 
preprocessing steps, 

- Figure 1c represents a simplified view of the source image 

matrix, 

5 - Figure 1d represents a simplified view of the matrix obtained 

after segmenting the field of motion, 

- Figure 2 represents a schematic view of the various steps of a 
second variant of the process, 

- Figure 3 represents the family of filtering curves corresponding 
10 to the psychovisual influence of human vision, which curves are stored in a 

database for determined velocities, 

- Figure 4 represents the multiresolution pyramid, 

- Figure 5 represents the Laplace pyramid. 

A first variant embodiment of the invention will be explained with 

15 the aid of Figure 1a In which the steps carried out, by the device allowing the 
evaluation of the quality of the images at the output of a coding procedure, 
are obtained through various devices which process, on the one hand the 
signals from a source image (10a) and on the other hand the signals 
representative of a decoded image (lOb). Each image is represented by a 

20 plurality of pixels pij arranged as a matrix, as represented in Figure 1c, and 
whose size depends on the definition desired for the image. To a given pixel 
Pij there corresponds a size of characteristic details, which is expressed as a 
function of the size of the matrix defining the number of pixels in the image, 
on the one hand by the frequency in cycles and on the other hand by the 

25 velocity in degrees per second. Each of the steps of the process according 
to the first variant embodiment is applied both to the source image and to the 
decoded image. The expression decoded image should be understood to 
mean any video image obtained at the output of a coding decoding device 
allowing transmission according to a standard such as, for example, MPEG. 

30 To fix matters better, the reader may also refer to appendix 1 which 
represents the various steps numbered from 1 to 7 of the process 
implemented according to the first variant. 

A first device (1a, 1b) for preprocessing source images (10a) and 
decoded images (10b) makes it possible to implement a first step of 

3 5 processing termed the preprocessing step carrying out a Gamma correction 
of the signals representative of the image and a correction of the contrast by 
a Weber law. Weber's law takes account of the fact that the eye is sensitive 
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to contrasts and that when gazing at a light spot of intensity I + dl on a 
background having a luminous intensity I, the ratio dl/l, called Weber's ratio, 
is practically constant at around 2% over a wide luminous threshold range, 
except for the very low luminous intensities and the very high luminous 
5 intensities. The correction of contrast takes into account a form of saturation 
of the eye linked with the fact that, for example, a zone of average intensity 
alongside a zone of high intensity will be less well distinguished than a zone 
of low intensity alongside a zone of average intensity. 



10 control the luminous intensity of the display performed by the cathode-ray 
tube is corrected by a so-called Weber law expressed hereinbelow: 



in which Lmax represents the maximum luminous intensity approximately 
equal to one hundred candeia per square metre (L^ax * 1 00 cd/m^) and 

1 5 Ldispiay the desired luminous intensity. 

This mathematical law is implemented by an electronic device 
making it possible to perform these calculations. By way of example, such a 
device can be built from a microprocessor associated with memories which 
contain the program corresponding to the calculation algorithm. 

20 For its part, the Gamma correction makes it possible to obviate 

the response of the television, that is to say the characteristics of the tube 
allowing display. Indeed, cathode-ray tube display devices are nonlinear 
devices and the light intensity reproduced on the screen of a cathode-ray 
monitor is not, in the absence of correction, proportional to its input voltage. 

25 Gamma correction is a procedure for compensating for this noniinearity so 
as to obtain correct and proportional reproduction of the corresponding 
luminous intensity at the input voltage. The Image from a screen is 
subdivided into pixels organized as a matrix In v/hich the position of the pixel 
Pij is defined by the indices i and j of the matrix. The value pij of the pixel is 

30 representative of the desired intensity. To correct the phenomena linked with 
the cathode-ray tube, a correction law which corresponds to the following 
equation: 



To take this effect into account, the voltage signal which will 





-max 
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is applied to this value representative of the voltage intended to obtain the 
desired luminous intensity, Gamma (y) having a value of between 2.3 and 
2.6 depending on the particular features of the cathode-ray tube. In this 
formula, e is the grid level of the value of the pixel py, 

©max is the maximum 

5 possible value of e, for example 256, if the control signals are expressed on 
8 bits, and Lmax is the intensity corresponding to emax in cd/m2, Lmax being 
approximately equal to 100 cd/m^. 

Another formulation of the Gamma law can be as follows: 
y=KsV^ 

10 with V = kaE^ 

in which y is the luminance, V the luminance voltage, E the illumination of 
the analysed image, y is an exponent of around 2.2 for black and white 
picture tubes and ya corresponds to a value of 0.45 commonly agreed for 
colour television, Ks and ka proportionality coefficients. 

15 This Gamma correction and Weber operation makes it possible to 

transform the pixel value received as Input to the preprocessing circuit {1a) 
into a final value P'ij = Ig (Pij)pij.lg which follows the law corresponding to curve 
1 represented in Figure lb. Each of the source Images (10a) gives rise to a 
plurality of preprocessed pixels pay and each of the decoded images (10b) 

20 likewise gives rise to a second plurality of preprocessed pixels p'ty. 

In parallel with this processing operation, a second step (2a, 2b, 
Fig. 1) of so-called motion estimation which allows the construction for each 
source and decoded image of the field of motion image, is implemented on 
the basis of each image sequence. This construction of the field of motion is 

25 performed between t and t-1 by conventional calculations, such as those 
calling upon the method explained in the book published by Don Pearson, 
MacGraw Hill Book Company and entitled "Image processing" "The Essex 
series in telecommunication and information systems", page 47 et seq. This 
estimation of motion over a sequence of images can use either the 

30 differential method, or the method of block matching, or the Fourier method 
or the method of estimation in three-dimensional space. For each Image, a 
certain number of motion vectors (Vj) are thus obtained and the image can 

thus be partitioned into regions (Ri,...,RiVi RnVn, Fig. 1d) based on the 

motion information, each region (RiVi) is therefore characterized by the fact 

3 5 that all the pixels of this region have a single associated velocity vector (v,). 
This splitting of the image into constant velocity regions constitutes a third 
step (3a, 3b) of so-called segmentation of the field of motion, applied to each 
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source and decoded image. A field of motion is therefore rendered 
homogeneous by virtue of the segmentation technique and the nearby pixels 
of motion are grouped into one and the same region. This field of motion 
thus segmented is then closer to the true motion of the objects of the scene 
5 represented by the images. This homogenization thus makes it possible to 
perform a slight denoising and to have a reduced number of different 
motions corresponding to a small number of velocities Vj, thereby reducing 
the number of filters to be calculated and to be stored in the next step so as 
to avoid having (255)^ motions to be calculated and to be stored in the 

10 context of an image consisting of 255*255 pixels pij. The estimation of the 
motion can also use the technique relying on the extraction of particular 
objects in the scene, such as edges or corners of objects consisting in 
following the motion of these particular elements from one image to another. 
This provides Information regarding motion at different locations of the image 

15 and an interpolation procedure is used to assign motion vectors to the 
remaining image areas. One way of measuring the motions of the angles or 
edges of elements consists in applying a high pass filter to the image so as 
to isolate the edges and thereafter to use the technique relying on the 
method of differentials to measure the value of the motion. The formation of 

20 borders can be attenuated with the aid of a low pass filter so as to reduce 
the effects of noise and allow the measurement of large motions. A low pass 
filter can, for example, be built for a matrix space of dimension 3-3 via the 
following matrix: 



1 


1 


1 


1 


1 


1 


1 


1 


1 



25 

A high pass filter can, for example, be built for a matrix space of 
dimension 3-3 via the following matrix: 



-0.125 


-0.125 


-0.125 


-0.125 


1 


-0.125 


-0.125 


-0.125 


-0.125 
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Thus, a table giving for an estimated velocity v the pixels of the 
image which belong to this region having this estimated velocity v in cycles 
per degree is stored in the form of the matrix of Figure 1d in the storage 
means of the device for segmenting the field of motion with a view to the 
5 subsequent use thereof. 

The procedure is continued via a fourth step implemented in 
parallel for each of the source (10a) and decoded {10b) images consisting in 
constructing a psychovisual filter (4a, 4b) for each. This fourth step is 
performed, for example, using a database of filter curves comprising a 

10 plurality of curves such as those represented in Figure 3 expressing the 
influence of the human factor H as a function of the velocity of the motion vj 
and of the spatial frequency fj. To these values v„ fi there corresponds a 
filtering value H. !f the velocity Vi lies between the velocities vi and V2 of two 
curves (Hvi, HV2), the device performs an interpolation so as to determine 

15 the corresponding value H. This Interpolation can, for example, be linear. 
This step can also be performed by direct calculation of the value H for a 
given velocity on the basis of an analytical model of the psychovisual 
influence. The analytical model of the filter can, for example, be represented 
by the following formula: 

2 0 G(a,v) = [6. 1 + 7.3 i log(v/3) | ^] x va^exp[-2a(v + 2)/45.9] with a = 2nf. 

In the fifth step, the device synthesizes a filtering value h(s,v) in 
the spatial domain for each filtering value H(fs,v) in the frequency domain 
and associated with a velocity v and a frequency fs by applying an inverse 
fast Fourier transform (FFT^) to the filtering values H(fs,v) in the frequency 

25 domain, this being expressed by the expression h(s,v) = FFF^[H(fs,v)] where 
fs represents the characteristic size of the details in cycles per degree, v 
their motion expressed in duration per second and s the resolution. 

The value h thus determined of the spatial domain for each 
source image (5a) and decoded image (5b) in the fifth step will be applied in 

30 the course of a sixth step (6a, respectively 6b) to each of the preprocessed 
pixels p'aij of the source image emanating from the preprocessing and of the 
decoded image p'tg emanating from the preprocessing. 

On each occasion, this sixth filtering step (6a, 6b) results in a pair 
of filtered pixel values p'afij and p'bfij which is thereafter used in a last step (7) 

35 of constructing a matrix of disparities (Disp) by calculating the quadratic error 
between each pair of pixel values. 

Disp = (P'aflj-P'bfij)" with n = 2 or other 
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This matrix will thus give an objective assessment of the 
distortions perceptible to the human eye which are introduced by the coding 
decoding procedure which one wishes to estimate. 

The second variant embodiment of the invention will now be 
5 explained in conjunction with Figure 2 which represents the various means 
allowing the implementation of the process according to this second variant. 
To fix matters better, the reader may also refer to appendix 2 which 
represents the various steps numbered from 1 to 11 of the process 
implemented according to the second variant and which also represents the 
10 decrease in size obtained through the operations of constructing the Laplace 
pyramids. 

In this variant, the first four steps of the first variant are applied to 
the source image (10a), namely the precorrection (la), the construction of 
motion fields (2a) and the segmentation (3a) as well as the construction of 
15 the psychovisual filter (4a). The pixels paij originating from the processing of 
the source image (10a) and resulting from the preprocessing step and the 
pixels Pbij resulting from the step of preprocessing the decoded image (10b) 
are each subjected in a step (5.1a and respectively 5.1b) to a Vz decimation 
filtering (F1/2). This filtering is a low pass filtering which makes it possible on 
2 0 the basis of a matrix of pixels representing an image Pn-i of a given level n-1 
to obtain the image Pn of next level n. This is expressed by the relation: 

Pn = F1/2 (Pn-i) with n > 0 
Po being the original image. 

By way of example the decimation filter can be built for a 3*3 
2 5 matrix space via the following matrix: 



1 


2 


1 


2 


4 


2 


1 


2 


1 



This operation of Vz decimation filtering by a calculation device 
has the result of reducing a matrix of pixels of size m-n representing the 
30 source image Pso to a level 1 matrix Psi of size m/2.n/2, the matrix Ps„ of 
level n being of size m/2n.n/2n. Likewise, this Vz decimation filtering 
operation has the result of reducing a matrix of pixels of size m-n 
represenfing the decoded or corrupted image Pdo to a level 1 matrix Pdi of 
size m/2'n/n2, the matrix Pdn of level n being of size m/2n«n/2n- Therefore, 
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for each source and decoded image, the calculation device stores the level n 
and the next levei n + 1 in its memory. 

Thereafter, in the next step {5.2a, respectively 5.2b), the 
calculation device deducts from each image Pn of levei n, the image Pn + 1, of 
5 immediately succeeding level, expanded by 2 {E2) so as to obtain what is 
referred to appropriately as a succession of matrices constituting what is 
referred to appropriately as a Laplace pyramid Ln, according to the formula: 
Ln = Pn-E2(Pn + i)forn<N 

with Ln = Pn- 

10 The expansion operation performed by E2 consists in interpolating 

the image Pn + 1 (of size m/2*n/2) to obtain an image of size m*n. 

This expansion or interpolation operation involves several 
interpolation matrices as a function of the position of the pixel to be 
interpolated. 

15 This makes it possible to build, for the source image, a pyramid 

LSn of stored matrices and for the decoded or corrupted image a second 
pyramid LDn of stored matrices. Depending on the choice of the filter FVi, the 
image LSn obtained at the end of the above step is a good approximation of 
the energy included within a frequency band centred around the value 

20 fn = 1/(n + 1). For further details regarding Laplace pyramids or so-called 
Gaussian pyramids and regarding expansion matrices, the reader may refer 
to the article "the Laplacian pyramid as a compact code image" published in 
the journal IEEE transactions on communications VOL. COM. 31, No. 4, 
April 1983 pages 532 to 540 authors P.J. Burt and Ed. H. Adelson. 

25 The Laplace pyramids (LSn, LDn) are obtained via steps (5.2a. 

and 5.2.b). 

In the next step, the device constructs, on the basis of the regions 
(RiVi) characterized by the same velocity vector (Vi) and defined at the output 
of the motion field segmentation step (3) applied to the source image, a 
30 multiresolution pyramid R,, of the region image, starting from the original 
region image Ro and by applying to this original region image Ro a 
Vi decimation median filter G1/2 according to the formula: 

Rn = G1/2 (Rn-1) 

Ro = the original region image. 
35 For further teaching regarding the building of a median filter 

known to the person skilled in the art, the reader may refer to chapter 4 of 
the book entitled "nonlinear digital filters, principles and applications" 
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published in 1990 by Kiuwer Academic Publishers, by I. Pitas and A.N. 
Venetsapouios. 

The (multiresolution) motion pyramid is produced only for the 
source image (10a) and the value of the pixel of LSn and LDn represent the 
5 energy which there is in a frequency band whereas via Rn one has the 
motion. 

in this step, the calculation device does not perform the 
calculation corresponding to step 5.2, that is to say to the subtraction of the 
image Rn corresponding to each level n from the image Rn + i of the 

10 immediately succeeding level n + 1 expanded by 2. At this calculation step, 
the matrices constituting the multiresolution pyramid are stored and make it 
possible to obtain for each pixel of each level n of Rn a local value of the 
motion. The median filter of a window of n*m pixels is obtained by ranking 
the values of the pixels and by retaining the pixel having the median value. 

15 The application of the median filter G1/2 to the matrix of pixels Rn 

of dimension m*n makes it possible to obtain a matrix Rn + i of dimension 
m/2*n/2. The decimation operation is included in that of the median filter. 
The median filter has the same effect on the images Rn as the filter F1/2 on 
the images Pn: it reduces their size by 2 horizontally and vertically, except 

20 that instead of being a conventional matrix filter, this is a "median" filter, that 
is to say one based on an analysis of the local statistics. 

The Laplace pyramids (LSn, LDn) calculated in step 5.2 are 
thereafter used in a step, represented in step 7, of calculating the 
level-to-level inter-pyramid differences according to the formula: 

2 5 Diffn = LSn - LDn- 

This makes it possible to obtain matrices Diffn, each coefficient of 
which expresses the values of the differences of the coefficients of the 
matrices of the source Laplace pyramid (LSn) and decoded Laplace pyramid 
(LDn) for the same level n and to do so for each level from 0 to n. In the 

30 Laplace pyramid LSn the value of the pixel represents the energy which there 
is in a frequency band. The frequency disparity between the two images is 
obtained for a given frequency band by taking the inter-pyramid difference 
LSn - LDn. 

This result is in fact weighted by the sensitivity of the eye for this 
35 frequency which is expressed by the relation of the relative influence of the 
activity of the frequency fn- This relative influence of the activity of the 
frequency fn can be masked by a large activity in the higher frequencies. To 
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determine and take into account this relative influence of the activity in a 
masking step (8), the calculation device begins by evaluating the local 
influence En of a pixel pg, v^hlch is defined by the value of the result of the 
calculation of the inter-pyramid differences, applied to the pixel pij, this result 
5 being raised to the power q: 

E„=(Diffn{Pij))^. 

this local influence value allows the calculation device to determine a matrix 
expressing the relative influence of the activity through a circuit which 
implements the following formula: 

k{n 

with m(Ek) = Ek if Ek > S 
and m(Ek) = S if Ek < S 

with for example S = 0.5% (maximum possible value of Ek). 

As in the previous variant, the calculation device performs a 

15 filtering step 4 by using a database (BD) containing a plurality of filtering 
curves expressing the influence of the human factor on the visual perception 
of the images. These filtering curves make It possible, on the basis of the 
values of the frequency and of velocity con-esponding to a pixel Pij to 
detennine a weighting coefficient H for this pixel. Thus, for each pixel py of 

20 the matrix Ln corresponding to a velocity region Rn, the calculation device 
determines a value H which will weight the relative influence In. This 
weighting step (9) is obtained through a calculation device implementing the 
equation: 

25 The implementation of this equation makes it possible to obtain a 

pyramid of matrices. When the program of the calculation device selects a 
pixel from a matrix of level n of the Laplace pyramid, to this level n there 
corresponds a spatial frequency fn and the calculation device is able to 
associate that pixel of the image Rn to which there corresponds a velocity 

30 value V. By using databases and curves recorded in these databases, the 
calculation device detennines either directly, or by inteipolation between two 
cun/es, the value of the gain coefficient H. In this second method, one works 
directly on objects (Laplace pyramids) which correspond to frequency 
quantities. There is therefore no need to switch to the spatial domain, since 

35 all the calculations are carried out in the frequency domain. This value H will 
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weight the relative influence of the activity (In). This weighting step (9) makes 
it possible to obtain a pyramid of matrices to which may be applied an 
optional step (10) catering for directional filtering so as to take account of the 
psychovisual directions favoured by the human gaze. 
5 Thus, it is possible to filter the images constituted by the matrices 
Tn through directional filters favouring one direction with respect to others, 
these filters consisting of matrices of coefficients of dimension n>n 
corresponding to the dimension of the image factor Tn. An example of a 0° 
directional filter matrix is given below for a dimension 5/5. 
10 
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An example of a 90° directional filter matrix is given below. 
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15 An example of a 45° directional filter matrix is given below. 
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The result of this directional filtering step (10) is sent to a 
20 summator circuit so as in a step (11) to recompose the muitiresolution 
pyramids P'n via the equation 
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P'n = E2 (P'n.i) + T„ (n < N) 

with P'n = Tn 

This therefore yields 

P'n-1 = E2 (Tn) + Tn.i 

5 The procedure is repeated iteratively to obtain P'o which 

represents the matrix constituting the map of disparities. 

These steps of the two variant embodiments of the invention are 
implemented with the aid of microprocessor circuits executing the 
appropriate programs making it possibie to carry out the steps set out earlier. 

10 These circuits also comprise storage means for storing, moreover, programs 
to be executed, the matrices of pixels or the matrices of regions or again the 
intermediate results making it possible to expect the next calculation step or 
again the filters to be applied to the intermediate or final results. 

This is used to compare the subjective performance of various 

15 coder apparatuses or again to compare the subjective perfomiance of 
various coding algorithms and/or to measure the perception of artefacts due 
to image processing. Depending on this performance, the calculation circuit 
implementing one of the two variant embodiments of the Invention can 
modify the coding by, for example, retroacting the measurement of 

20 subjective quality of the coded images thus performed on the global and/or 
local parameters of the coding. A global parameter on which the retroaction 
may be perfomned can, for example, be the mean bit rate and a local 
parameter can. for example, be the local quantization interval used during 
the coding. This retroaction can be dynamic during coding, the errors 

25 retroacting on the local quantization interval, the size of the images, the form 
of the GOP (Group of pictures) etc. The retroaction can also be perfomied 
iteratively in the case of codings for video disc (DVD) or CD-ROM. In this 
case, so long as the error is above a threshold and/or Is not homogeneous 
over the whole Image, the retroaction of the calculation circuit operates a 

30 decrease in the severity of the coding parameters globally and/or locally. So 
long as the error is below a threshold, the measurement of the subjective 
quality of the images performed by the calculation circuit makes it possible to 
Increase the severity of the coding parameters. 

Lastly, a penultimate use of the method of estimating or of 

35 measuring the subjective quality of a sequence of images emanating from a 
coder can relate to object oriented coding. In this case, the use of the 
calculation device of one of the methods implemented by this device makes 
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it possible to ensure a constant subjective quality of the various objects of 
the scene or to ensure a given inter-object relative subjective quality. Lastly, 
the estimation process and device can make it possible to modify the 
conditions of dynamic reallocation of the bit rates allocated to each of the 
5 channels of a broadcast with statistical multiplexing so as to ensure a given 
and homogeneous subjective quality of the programmes broadcast. 

Other modifications within the scope of the person skilled in the 
art also form part of the spirit of the invention. 

A variant embodiment of the invention consists In using, as signal 

10 representative of the estimation of the field of motion, the per-macroblock 
motion vectors emanating from the procedure for coding/decoding the 
decoded image, for example during the MPEG-type coding. 

Another variant embodiment of the invention consists In replacing 
the decoded image by a noisy source image. The latter can be constructed 

15 for example on the basis of the source Image to which white noise is added 
(random variable unifonn in all the spatial frequency bands). The disparity 
maps obtained can then be regarded as a prediction of the zones of the 
image where the coding errors will be most perceptible "a priori", that is to 
say before having performed the coding proper. 

20 These disparity maps may then be used In the implementation of 

devices for preprocessing the source images for the purpose of preventing 
the generation of artefacts or coding defects during a future coding 
procedure. A preprocessing device consists for example of a circuit for 
prefiltering and/or for reducing the energy in the high frequencies, in the 

25 zones of the image where the visibility of the coding artefacts is lowest, 
zones supplied by the disparity maps. 

These "a priori" disparity maps can be used to further reduce the 
bit rate necessary for coding in the zones where the visibility of the coding 
artefacts is predicted to be lower "a priori". 

30 These "a priori" disparity maps can also be used to locally 

measure the amount of "hidden" infonnation which can be inserted into the 
source or decoded images without being perceptible (Watermarking). 
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APPENDIX 1 
Method 1 

Step 1 Precorrection of images: Gamma Correction of the screen and 
5 Contrast Correction (Weber's law). 

Step 2 Construction of the Field of Motion image based on the Source 
Sequence and for each image. 

10 Step 3 Segmentation of the Field of Motion. For each image, a Segmentation 
into Regions is thus available, based on the Motion information. 
Each Region(v) is therefore characterized by a velocity vector v. 
Each pixel of each image (Source or Decoded) belongs to a 
Region corresponding to an estimated velocity v (in cycles per degree). 

15 

Step 4 For each Region(v), Constmction of the corresponding Psyctiovisual 
Filter, on the basis of a BDD filter { i-i(fs,Vi)i=i,...t^ and interpolation of the 
filters. 

2 0 Step 5 For each Region(v), Synthesis of the Spatial Filter by inverse FFT: 

h(s,v)=FFT'[H(fs,v)] 

Step 6 Filtering of the Source and Decoded images to obtain two other 
images: 

25 SourceF and Decoded F. 

Each pixel P of the Source/Decoded image is filtered by the Filter 
h(s,v), corresponding to the Region(v) to which P belongs, centred on P and 
applied to the Source/Decoded image. 

3 0 Step 7 Construction of the Map of Disparities or psychovisual Errors 

Err =■ (SourceF ' DecodedFf (n - 2, or other) 
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APPENDIX 2 
Method 2 

Step 1 (see Method 1). 

5 

Step 2 (see Method 1). 

Step 3 (see Method 1). 

1 0 step 4 (see Method 1). 

Step 5 Decomposition of the Source and Decoded images into a Laplace 
Pyramid of N levels constructed in two steps: 

• Each level Pn is firstly obtained by Vz Decimation-Filtering 
15 (low-pass) of the immediately preceding level (multiresolution 

pyramid, Figure 4). 

Pn = Fi/2<P,^l) n > 0 

Po = original image 

• Then from each level Pn is deducted the immediately 
20 succeeding level expanded by 2 so as to obtain Ln (Laplace 

pyramid, Figure 5). 

Ln = Pn - E2(Pn.l)n < N 

Ln=Pn 

This calculation makes it possible to obtain a representation of a 
25 multiresolution pyramid Pn in accordance with Figure 4 and a representation 
of a Laplace pyramid Ln in accordance with Figure 5. 

If the Filter F1/2 is well chosen, the image Ln is a good 
approximation of the energy included within a frequency band centred 
around 

30 fn=1/(n4-1). 

Finally, two Laplace Pyramids are available: LSn (Source) and LDp 
(Decoded). 

Step 6 By the same principle, the multiresolution Pyramid Rn of the Regions 
35 image is constructed by replacing F1/2 by G1/2 V2 Decimation/median Filter. 
Thus; for each pixel of each level of the Laplace Pyramids (Step 5), the local 
value of the motion is available. 
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Step 7 Calculation of the level-to-ievel inter-Pyramid Differences: 

Diffn^LSn-LDn 

5 Step 8 Application of the principle of frequency masking (Texture/Masking): 
The relative influence of the activity at the frequency fn is masked 
by considerable activity in the higher frequencies (fk < n). 

The relative local influence of pixel pi, In(pi) is then defined by: 

^.= vfr/r . = (Diffn(pi)f 

1 0 with q = 2 for example. 

Step 9 Filtering of the Source and Decoded Laplace Pyramids. 

Each pixel pi of U is weighted by the value H(fn,v), corresponding 
to the Region(v) to which pi belongs in Rn, and by the relative influence /„. 
15 Tn(pi) = !n(pi) X H 

Step 10 Directional Filtering: to take account of the psychovisuai directions 
favoured by the human gaze, it is possible to filter the images Tn by 
directional filters which favour one direction with respect to others. 

20 

Step 1 1 Construction of the Map of Disparities or psychovisuai Enors: the 
multiresolution pyramids P'n are recomposed: 

P'n = Es (Pn+l) + Tn (n < N) 

P'n = Tn 

2 5 The Map of disparities corresponds to P'o. 
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CLAIMS 

1 . Process for evaluating the quality of coded images, characterized 

in that it comprises: 

5 a) a step of processing the signal representative of the image so 

as to obtain a processed signal, 

b) a step of constructing on the basis of the signal representative 
of the coded image, a signal representative of the field of motion image on 
the basis of the source sequence, 

^° c) a step of building a signal representative of the segmenting of 

the field of motion and of storing the image pixels representative of each 
region having a different field of motion at an address defined with respect to 
the velocity vectors estimated in the step of constructing the field of motion 
making it possible to detennine the pixels having different velocity vectors, 

1^ d) a step of determining or of calculating a psychovisual human 

filter to be applied as a function of the estimated velocity of the region, 

e) a step of filtering the processed signal, and 

f) a step of constructing the map of disparities between the 
signals representative of the image which are obtained after the filtering step 

20 and the signals representative of the decoded image which are obtained 
after the filtering step. 

2. Process for evaluating the quality of coded images according to 

Claim 1 , characterized in that it comprises a step consisting In applying each 
of the preceding steps to the source image and to the decoded image. 

25 3. Process for evaluating the quality of coded images according to 

Claim 1, characterized in that it comprises a step of frequency 
decomposition of the images (FFT, subband, etc.) which precedes the 
filtering step and consists of a weighting by a coefficient deduced from 
curves taking into account the estimated velocity and the frequency band 

30 considered, so as to take account of the relative influence of the velocity and 
of the spatial frequency on the perception of the moving images. 
4. Process according to Claim 1 , characterized in that the 

psychovisual filtering step is applied to matrices representative of the 
inter-pyramid differences between the Laplace pyramids of the processed 

35 source images and those of the processed decoded images after weighting 
by, on the one hand, the local influence representative of the frequency of 
the pixel concerned and, on the other hand, a filtering coefficient deduced 
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from filtering curves taking into account the estimated veiocity and the 
frequency band corresponding to the leve{ of the Laplace pyramid to which 
the pixel belongs in a multiresolution pyramid obtained by constructing a 
pyramid on the basis of the image of each region of different velocity. 
5 5. Process according to Claim 1, characterized in that the 

psychovisual filtering curves are either built from a succession of curves 
arranged in the form of a database and stored in the system, and possibly 
interpolation on the basis of these curves, or obtained by analytical 
representation implemented by calculation means making it possible to 
10 calculate each curve. 

6. Process according to Claim 4, characterized in that the step of 
constmcting the map of disparities is performed by recomposing the filtered 
multiresolution pyramids obtained in the preceding step. 

7. Process according to Claim 4 or 6, characterized in that the step 
15 of processing the image comprises a step of decomposing the source and 

decoded images into a Laplace pyramid of n levels and a step of 
constructing the inter-pyramid difference. 

8. Process according to Claim 1 , characterized In that the velocity or 
local value of the motion is obtained by possible construction of filters 

20 followed by application of the filter constructed or by application of a median 
filter. 

9. Process according to Claim 1 or 4, characterized in that it 
comprises a step of precorrecting the images by performing a Gamma 
correction and a correction by Weber's law. 

25 10. Process according to Claim 7, characterized in that the Gamma 

correction y is as follows: 

y = Ks with V = kaE^ 
in which y is the luminance, V the luminance voltage, E the illumination of 
the illumination analysed image, ys is an exponent of around 2.2 for black 
30 and white picture tubes and ya has a value of 0.45 commonly agreed for 
colour television. 

1 1 . Process according to Claim 1 , characterized in that the filtering is 

obtained by constructing the psychovisual filter corresponding to the velocity 
estimated on the basis of a database of filters and interpolation between the 
35 two filters corresponding to the regions closest to the region whose velocity 
has been estimated. 
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12. Process according to Claim 4, characterized in that the relative 
local Influence (In) of the pixel Pi concerned Is obtained by calculating a value 
En representing the q**^ power of the inter-pyramid levei-to-!evel difference 
between the source pyramids and decoded pyramids of like level of the pixel 

5 concerned. 

13. Process according to Claim 12, characterized in that the 
calculation of In is performed by using the following formula: 



k{n 

with En = (Diffn(Pij»^ 

10 m(Ek) = Ek if Ek > S 

and m(Ek) = S if E^ < S 

with for example S = 0.5% (maximum possible value of Ek). 

14. Process according to Claim 4, characterized in that the filtering 
comprises a directional filtering of the images in a determined direction 

15 rather than in another. 

15. Process according to Claim 9, characterized in that the Gamma 
correction is perfomned by a calculation device implementing the following 
equation: 

^display - -^ax ^ I 

20 e being the grid level value of the pixel, e^ax being the maximum value 
example 256 if the coding is performed on 8 bits, Lmax being the intensity 
corresponding to emax in cd/m^. 

16. Process according to Claim 9, characterized in that Weber's law is 
implemented by a calculation device which canries out the following function: 

17. Process according to Claim 1 or 4, characterized in that the 
calculation of the filter is obtained through the following formula: 

G(ay) = [6.1 + 7.3 1 log(vy3) i ^ x va2exp[-2a(v+2y45.9] 
with a = 27if, f = spatial frequency, v = velocity. 
30 18. Use of the process according to one of the preceding claims in a 

coding device, characterized by a dynamic retroaction as a function of the 
psychovisual disparities calculated by the calculation device implementing 
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the process on one of the parameters used by the coding device in the 
course of the coding. 

19. Use of the process according to Claim 18, characterized in that 
the calculated disparities are compared with a threshold so as to modify the 

5 coding parameters of the coding apparatus until the desired threshold is 
overstepped. 

20. Use of the process according to Claim 19, characterized in that 
one of the parameters is either the quantization interval, or the size of the 
images, or the form of the group of pictures GOP. 

10 21. Use of the process according to Claim 18, characterized in that 

the homogeneity of the calculated disparities is analysed by the calculation 
device so as to act on the coding parameters of the coding apparatus. 

22. Use of the process according to the preceding claims, 
characterized in that the coding parameters of the different objects of an 

15 image whose coding is object oriented are modified as a function of a 
constant desired disparity. 

23. Use of the process according to Claims 18 to 22, characterized in 
that it consists in performing a dynamic reallocation of the bit rates allocated 
to a coding apparatus with multiplexing. 

20 24. Device for evaluating the quality of coded images, characterized 

in that it comprises: 

- a means (la, 1b) of processing the signal representative of the 
source image (10a) and of the decoded image (10b) so as to obtain a 
processed source image signal and a processed decoded image signal, 

25 -means (2a, 2b) of constructing on the basis of the signal 

representative of each of the images, a signal representative of the 
estimating of the field of motion on the basis of each of the images of the 
source and decoded sequences, 

-means (3a, 3b) of building a signal representative of the 

30 segmenting of the field of motion and of storing the image pixels 
representative of each region R, having a different field of motion at an 
address defined with respect to the velocity vectors estimated in the step of 
constructing the field of motion making it possible to detemiine for each of 
the source and decoded images those having different velocity vectors, 

35 - a means (4, 5) of determining or of calculating a psychovisual 

human filter to be applied as a function of the estimated velocity of the 
region. 
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- means (6a, 6b) of filtering applied to each of the processed 
source images and processed decoded images and 

- a means (7) of constructing the map of disparities between the 
signals representative of the processed source image which are obtained 

5 after the filtering step and the signals representative of the processed 
decoded image which are obtained after the filtering step. 

25. Device according to Claim 24. characterized in that the 
psychovisual filtering means are applied to matrices representative of the 
inter-pyramid differences calculated by calculation means between the 

10 Laplace pyramids of the processed source images and those of the 
processed decoded images after weighting by, on the one hand, the local 
influence representative of the frequency of the pixel concerned and, on the 
other hand, a filtering coefficient deduced from stored or calculated filtering 
curves and taking into account the estimated velocity and the frequency 

15 band con-esponding to the level of the Laplace pyramid to which the pixel 
belongs in a multiresolution pyramid obtained by means of constructing this 
multiresolution pyramid on the basis of the image of each region of different 
velocity. 

26. Device according to Claim 24, characterized in that the means of 
20 constructing the map of disparities perform a recomposition of the filtered 

multiresolution pyramids. 

27. Device according to one of Claims 24 to 26, characterized in that 
the means of processing, the means of building, the means of determining, 
the means of constructing, the means of filtering consist of at least one 

25 microprocessor associated with memories sufficient to contain the programs 
making it possible to embody the various means and to contain the 
databases and the intermediate infonnation necessary for the calculation 
and for obtaining the map of disparities. 

28. Process according to Claim 1, the images being coded according 
30 to the MPEG standard, characterized in that the step of constructing a signal 

representative of the field of motion image exploits the per-macroblock 
motion vectors calculated during the coding of the Images according to the 
MPEG standard. 

29. Process according to Claim 1, characterized in that the decoded 
35 image is a noisy source image constructed on the basis of the source image 

to which white noise is added. 
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30. Use of the process according to Claim 29 for predicting, on the 
basis of the map of disparities, the regions most sensitive "a priori" to the 
coding errors and for coding the regions as a function of this prediction. 

31 . Use of the process according to Claim 29 to perfonn a prefiitering 
5 of the source images as a function of the map of disparities. 

32. Use of the process according to Claim 29 for determining locally 
the amount of information which can be inserted into the images 
(Watermarking) without this addition being perceptible. 
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